Korean Document Classification Using Extended Vector Space Model
نویسندگان
چکیده
منابع مشابه
Document Vector Space Representation Model for Automatic Text Classification
Classification of text documents presents a unique challenge to conventional classification algorithms. Due to the existence of large number of features in the datasets, providing a desired representation for text documents can be seen as another problem. In this paper a simple but effective representation model for text documents to tackle the classification problem is discussed. Two different...
متن کاملText Document Pre-Processing Using the Bayes Formula for Classification Based on the Vector Space Model
This work utilizes the Bayes formula to vectorize a document according to a probability distribution based on keywords reflecting the probable categories that the document may belong to. The Bayes formula gives a range of probabilities to which the document can be assigned according to a pre determined set of topics (categories). Using this probability distribution as the vectors to represent t...
متن کاملEntity-Based Cross-Document Coreferencing Using the Vector Space Model
Cross-document coreference occurs when the same person, place, event, or concept is discussed in more than one text source. Computer recognition of this phenomenon is important because it helps break "the document boundary" by allowing a user to examine information about a particular entity from multiple text sources at the same time. In this paper we describe a cross-document coreference resol...
متن کاملDocument summarisation based on sentence ranking using vector space model
WWW is a repository of large collection of information available in the form of unstructured documents. It is a challenging task to select the documents of interest from such a huge document pool. To fasten the process of document retrieval, text summarization technique is used. Ranking of documents is made based on the summary or the abstract provided by the authors of the document. But it is ...
متن کاملDocument Ranking and the Vector-Space Model
Using several simplifications of the vector-space model for text retrieval queries, the authors seek the optimal balance between processing efficiency and retrieval effectiveness as expressed in relevant document rankings. fficient and effective text retrieval techniques are critical in managing the increasing amount of textual information available in electronic form. Yet text retrieval is a d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The KIPS Transactions:PartB
سال: 2011
ISSN: 1598-284X
DOI: 10.3745/kipstb.2011.18b.2.093